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Abstract 

The aim of this paper is to show a connection between an extended theory of 
statistical experiments on the one hand and the foundation of quantum theory 
on the other hand. The main aspects of this extension are: One assumes a 
hyperparameter space <i> common to several potential experiments, and a basic 
symmetry group G associated with this space. The parameter 9a of a single 
experiment, looked upon as a parametric function Oa{-) on $, is said to be 
permissible if G induces in a natural way a new group on the image space of 
the function. If this is not the case, it is arranged for by changing from G to 
a subgroup Ga- The Haar measure of this subgroup (confined to the spectrum; 
see below) is the prefered prior when the parameter is unknown. It is assumed 
that the hyperparameter itself can never be estimated, only a set of parametric 
functions. Model reduction is made by restricting the space of complex 'wave' 
functions, also regarded as functions on <i>, to an irreducible invariant subspace 
M. under G. The spectrum of a parametric function is defined from natural 
group-theoretical and statistical considerations. We establish that a unique 
operator can be associated with every parametric functions da{-), and essentially 
all of the ordinary quantum theory formalism can be retrieved from these and 
a few related assumptions. In particular the concept of spectrum is consistent. 
The scope of the theory is illustrated on the one hand by the example of a 
spin 1/2 particle and a related EPR discussion, on the other hand by a simple 
macroscopic example. 
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1 Introduction 



One general motivation behind this paper departs from the observation that several 
cultures exist, also in science, even related to something as objective as the use of 
quantitative methodology. A possible way to approach this difficulty may be to 
aim at some kind of partial synthesis between methodology traditions that have 
developed independently of each other, in the present case statistical theory and 
quantum theory. Such endeavour must necessarily have a long-term perspective, so, 
although several concrete results are obtained here - even to the extent that some 
sort of synthesis of the two disciplines may seem to be in sight - a solution to the 
basic philosophical problem of finding a general way to stay impartial in science 
can of course in no way be promised. Nevertheless, there is a hope that something 
general can in fact be learned from an investigation like this. 

In Hclland (1998a) it was argued tentatively that the basic principles of quantum 
mechanics in its usual Hilbert space formulation is consistent with an ordinary gen- 
eral statistical modelling framework which includes the following features: 1) For a 
given system one has the choice between several experiments that can be performed. 
2) There is an unobscrvablc hypcrparameter (j) which is common to all potential 
experiments. 3) There is a symmetry group G acting on $ = {^}. (In Helland, 
1998a, the notation G was used, since G was reserved for the corresponding sample 
space group; in the present paper we will concentrate on parameter transformations, 
however, so we change notation.) 4) The functions (</>)> giving the parameter of 
experiment a, satisfy a natural permissibility condition as a function of the hyper- 
parameter (f), namely: 9a{(pi) = 6a{<p2) implies 6a{g<t>i) = OaigH) for all g. 

We will show in this paper that in order to derive a version covering ordinary 
quantum mechanical systems from these assumptions, one has to include the case 
when the permissibility condition is not with respect to G, but only with respect to 
a subgroup Ga of G. In each experiment, we let Ga be the maximal subgroup for 
which 9a is permissible. In the case where is unknown, this is expressed relative 
to each experiment a, namely by taking the Haar measure relative to Ga as a prior. 

For statisticians it should be mentioned that even if the word prior is used here, 
we will not enter any discussion on Bayesian versus frequentist inference. In fact, as 
pointed out in Helland (1998b), the case with objective Haar prior is just the case 
where the two schools give similar results. 

The arguments of Hclland (1998a) were based on quantum logic, and gave there- 
fore only an indirect construction of the Hilbert space in question. Here we will 
give a direct construction using group representation theory. The requirement that 
a statistical model should be narrow, is made concrete by requiring that the rele- 
vant Hilbert space subspace should be irreducible under G. A definition of the state 
of a system is in general based upon what is known about some given parameter. 
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which we sometimes will call parametric function to distinguish it clearly from sin- 
gle parameter values. (The parameter may be multidimensional, i.e., the case with 
several onc-dimensional parameters is of course included). An operator correspond- 
ing to a parameter/ parametric function is found via a mathematical theorem, and 
this operator may then be used to derive the ordinary formula for probabilities and 
expectations. 

If the main idea here is accepted, there seems to be room for further developments 
both within statistics and within physics; some tentative ideas follow. 

For theoretical statistics: The present focus is very much on classes of probability 
measures where each class has a relatively small set of parameters. In many appli- 
cations this can be complemented by a rather rich and useful theoretical platform 
focused on a large parameter space where only a few specific parametric functions 
are of direct interest. Sometimes this same parameter space can then be used in 
probability models for several potential experiments, perhaps also for single models 
conditioned in different ways. Of course, this is a much more complicated concept 
than the standard one, but one can compensate by paying less attention to the de- 
tailed properties of the probability distribution chosen. Ideas not far from this are 
in fact also discussed in good books in experimental design, like Box, Hunter and 
Hunter (1978) and in books and papers in data analysis by various authors. 

The other specific feature of the models in this paper: a close link between 
probability models and symmetry groups, will be amply discussed below. Such a link 
was argued for by using purely statistical arguments in HcUand (1998b). Another 
potentially important relation to statistics from the topic of this paper seem to be to 
the theory of design of experiment; see the use of group theory in papers like Bailey 
(1991), Bailey and Rowley (1990), and quite recently, McCuUagh (2000). 

For theoretical physics: As many others who try to look at quantum theory 
from the outside, I first of all miss simplicity of fundamental ideas. Although this 
definitively constitutes the standard theory with a huge literature and ample exper- 
imental support, it still seems to me to be difficult to accept as the final solution 
a theory that takes something as abstract as rays in a Hilbert space as its basic 
state concept, and where the relativistic generalizations have even more abstract 
foundations. Renouncing this foundation when no satisfying alternative exists, is of 
course impossible, but there do exist books on quantum theory that have a relatively 
open attitude towards foundational issues (Isham, 1995; Peres, 1993, and references 
there), and radically new foundations have at times been proposed. 

As a contrast to the Hilbert space concept of state, the concept of symmetry, 
and hence of symmetry groups acting on some given space or set, is a very simple 
one, basically, but still a concept from which much interesting mathematics can be 
developed, and, as shown below, very much of the essential structure of quantum 
theory can be regained. Our aim is that one day a complete theoretical physical 
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platform shall emerge based on simple concepts like sets and symmetry groups, and 
that derived vector space concepts, for instance group representations, will be held 
for what they really are: extremely useful calculating tools, but - as shown in the 
examples below - not the only possible ones. 

Also, the statisticians' attitude towards mathematical modelling of real phenom- 
ena is today somewhat different from what we find in theoretical physics: In statis- 
tics, nearly all models are approximations, and they always depend upon unknown, 
hypothetical parameters, whose value we can never determine completely, only find 
uncertain estimates of. Physicists seem to be more used to thinking of an absolute, 
objective world behind their endeavours. In the long run, we may perhaps have 
something to learn from each other. 

Some physicists may feel that the discussion that follows is fairly abstract. I can 
only give two comments on this: First, the point of departure, models for statistical 
experiments, may be unfamiliar to physicists; however, for the present paper one 
does not really need much more from this than the fact that everything depends 
upon some space of unknown parameters. Secondly, the main mathematical work 
here is really to demonstrate that a version of the ordinary Hilbert space based 
quantum theory is indeed consistent with this framework. 

The plan of the paper is as follows: We introduce some simple concepts based on 
set- and group-theory in Section 2. In Section 3 an equivalence is established between 
parameters and certain subspaces of a linear 'wave-function' space, and in Section 
4 a kind of spectra of a parameter is defined from group-theoretical and statistical 
concepts. In Section 5 the basic concepts are illustrated on the example of a spin 1/2 
particle, and a short discussion of the EPR paradox is given. The relation between 
parameters (statistical theory) and 'wave functions' is made more concrete in Section 
6, while the necessary background on statistical experiments is given in Section 7 
and Section 8. Then Section 9 and Section 10 give a rather thorough discussion 
of the simplest type of experiments, those where the parameters of interest are 
permissible with respect to the underlying group. The quantum theoretical formulae 
for expectation and for change of state are established for this case, but should then 
probably be regarded as a complicated way of saying simple things. This discussion 
also leads to the rules for change of measure during a measurement. In Section 11 
we give the first discussion of experiments with non-permissible parameters, and also 
give some technical results needed for the final discussion. Finally, in Section 12 and 
Section 13 we give the main results on comparing the two approaches to modelling 
and predicting experimental results. The rest of the paper consists of examples and 
discussion. 

Related ideas, namely that symmetries should constitute the main foundation 
behind quantum theory, have recently been advocated in great detail from a physical 
point of view by Bohr and Ulfbeck (1995). The present paper aims at making such 
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ideas as precise as possible, which seems to make necessary a point of departure 
radically different from what is ordinary. However, by using various mathematical 
tools, most notably the representation theory of groups, the resulting formalism will 
in the end be very close to ordinary quantum theory. 

We have followed the statistical custom of denoting the state variables of a system 
by greek letters, called parameters. Alternatively, we could have denoted the system 
variables by more physical symbols like x,p,s etc.. One could then argue that it 
follows from the arguments of this paper that quantum theory is close to what 
one can find from classical physics under symmetry assuming that information is 
restricted and that one has the choice between several complementary experiments. 
We have four reasons for not choosing this last notation, however. First, we want to 
have the possibility of distinguishing between system parameters and experimental 
values. Secondly, it is a very important point of the present theory that models should 
be reduced as much as possible. Thirdly, in certain situations it seems necessary to 
include the aspect that the notion of 'state' may depend on what is known about 
a system. Finally, we want to indicate that quantum theory not only is close to 
classical physics under the stated conditions; it is close to any system under such 
conditions. The characteristics that are specific both to classical physics and to 
quantum mechanics, namely a Hamiltonian and an ensuing dynamical temporal 
development, are just the issues that we will not address in this paper. 

A summary of the theory may be found in Sections 12 and 13. The two examples 
in Section 5 and Section 14 may also serve to illustrate the theory. Here are, very 
briefly, some key concepts and results: Definition 1 gives the core definition of a 
permissible parameter; in Proposition 1 it is shown that this property can always be 
obtained by reducing the group. In Theorem 1 we see the first connection between 
space (quantum theory) and parameter (statistics); this is followed up in Theorem 2 
with single functions and fixed parameter values; in Proposition 3 we sec that Haar 
measure splits in a natural way when permissible parameters are observed. The 
Theorems Al, 4, 5, 6 and 7 and Corollary 5 are all variants of the same theme: 
Parameters can be associated in a unique way by operators. The basic parameter 
reduction is introduced in Definition 6: The state space should be irreducibly in- 
variant under the basic group. The spectrum is defined in an essentially simple way 
in Definition 7: The main thing is that the (hyper)parameter space is reduced in 
such a way that the state space is a parametric invariant space. This is shown to 
be equivalent to the ordinary definition of spectrum in Theorem 8. The results of 
Section 13 show that ordinary quantum theory is contained in the framework of the 
main body of this paper, and it is in fact equivalent with it if we make a few extra 
assumptions, most notably the reversibility of transition probabilities. 



5 



2 Basic framework. 



Let $ be any fixed set, thought of as containing the description of all features, 
known or unknown, of some isolated system, physical or otherwise. Let G be any 
fixed group of transformations of Such groups always exist, of course, if nothing 
else, it could be the set of all automorphisms on but in many cases we will think 
of a proper subgroup of that one, expressing some natural symmetry property of the 
system description. 

Knowing the value of 0i G $ will imply full knowledge of the system in all 
details. For reasonably complex systems it will be impossible to achieve such detailed 
knowledge; it is systems of this kind that are the objects of this paper. The most 
one then can hope for, is to get to know the value 6i of some function 6{-) on 
In statistical investigations this situation is quite common, but it is seldom made 
explicit. A statistical experiment is in general designed to get information about 
some specific (perhaps multidimensional) parameter 9, and in the ideal case we can 
conclude after the experiment (if it has maximal accuracy) that this parameter has 
some specific value 6i. Such ideal experiments will play a great role in this paper; in 
fact the quantum mechanical literature is full of ideal experiments of this kind. We 
will, however, also discuss briefly the modifications needed to include experimental 
error. 

Below we will frequently meet the situation where several such parametric func- 
tions can be defined, typically corresponding to difi'crcnt potential experiments on the 
same system. But the notion is wider; in statistical contexts it may also correspond 
to the same experiment conditioned in different ways, to different (non-orthogonal) 
factors in a factorial experiment, or to different choices of a set of contrasts in an 
analysis of variance experiment. A proper discussion of these issues belongs to the 
statistical literature, however. Perhaps it may be more appropriate to talk about 
different questions posed about a system in such cases than to talk about different 
experiments. As is well known from quantum mechanics, an experiment designed 
to determine the value of one parametric function ('variable') may make the deter- 
mination of another parameter difficult or impossible. In my opinion, such cases of 
complementarity often occur in macroscopic contexts, also. I will come back to this 
issue later. 

Now we will bind together in a new definition the two concepts discussed so far. 
Definition 1. 

The parameter 9{-) on <1> is said to be permissible with respect to G if we always 
have that 9{(pi) = 9{(p2) implies 9{g(pi) = 9{g(p2) for all g £ G. 
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The first implication of this concept is that a new group G can be introduced, 
a group of transformations g on the image space of the permissible parameter, 
where g is defined by 

g{em = o{g(t>)- 

(The simple proof is given both in Helland (1998a) and in Helland (1998b).) This 
may often imply that it is possible to change focus from the large space $ with the 
big group G to the smaller space with the simpler groTip G. Note that the mapping 
5 ^ 5 is a homomorphism: gi ^ gii 92 ^ h imply 5f ^ ^ ^ and gig2 ^ gih- 

Secondly, this notion turns out to have implications in statistical inference theory, 
as shown in Helland (1998b) (, where the same notion is called invariantly estimable, 
to conform to statistical usage) : Certain paradoxes in Bayesian inference are avoided 
when inference is confined to permissible parameters, best invariant estimators equal 
Bayesian estimators under Haar prior, and credibility sets (under Haar prior) and 
confidence sets coincide. Further results on Haar measures from the assumption of 
permissibility are derived in Section 9 below. We will also meet more permissibility 
results in the next section. 

Typically, the parametric functions that correspond to quantum mechanical vari- 
ables will not be permissible with respect to the basic group G. However, they can 
always be made permissible with respect to a smaller group. 



Proposition 1. 

Let Ga be the set of g & G such that for all 4'i,4>2 & ^ we have that 9a{4>i) = 
Ga{4'2) if o,nd only if 0aig<Pi) = Ga{94'2)- Then Ga is a subgroup of G, and the 
maximal one such that da{') is permissible with respect to Ga- 



Proof. 

Ga contains the identity. Using the definition with (/>!, (j)2 replaced by g2<t>\i 924^2, 
it follows that gig2 € Ga when gi € Ga and 52 € Ga- Using the definition with 
(t>\,(f>2 replaced by g~^(j)i, g~^(l)2, it is clear that it includes inverses. It follows from 
the construction that it is maximal. 

It will be useful later to consider the ordering of parametric functions: 
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Definition 2. 

Wesaythate^^\-) ^ 0(2) (.) iff = ^(0i'^)[(j))) for some i;. Ife^^\-) ^ 9(^\-) 

and 9^'^\-) :< 0^^\-) we say that the two parametric functions are equivalent. 



Note that the basic hyperparametric set $ might as weU be replaced by any 
equivalent set. Note also that this set will be maximal under the ordering ^. 

The following definitions are not all strictly necessary for the theory that follows, 
but they might serve to focus upon the situation under consideration. 



Definition 3. 

A frame is defined as any collection of parametric functions {9a{-);a G A} such 
that the mapping (f) — > {9a{(f));a G A) can be inverted. 



Since it will typically not be possible to perform (at the same time) experiments 
corresponding to all a € it does not follow from this that the hyperparameter (p 
can be recovered from experiments. 

Typically, in a frame, all (or some) 9a{-) arc nonpermissible with respect to G. 
Thus we need to know how to tackle such functions. We choose to introduce the 
weaker concept of consistency. 

Definition 4. 

(a) The frame {9a{-);a G A} is consistent if each 9a{-) is permissible with 
respect to a group Ga such that G is contained in the smallest group containing all 

Ga. 

(b ) The hyperparameter space $ is consistent if it can be connected to an consis- 
tent frame. 

In this paper, all frames and hyperparameter spaces under consideration will be 
assumed to be consistent. Furthermore, it will be assumed that the subgroups Ga 
have been chosen in a maximal way, which they can. 
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Definition 5. 

Given a frame {9a{-)',a S A}, a minimal hyperparameterspace is defined as a set 
^ of parameters ip = permissible with respect to G, such that each 9a will be a 

permissible function ofip, and such that V'(') is minimal with respect to this property 
under the ordering 

Definition 5 aims at reducing the parameter space as much as is possible. If 
^ 7^ it is completely certain that the points € $ are impossible to recover 
from experiments; the most one can hope for, is information on = ip{4>). So in 
this meaning, only the space ^ makes sense, not The function spaces constructed 
later can be carried out for ^ as well. When experiments are performed, the minimal 
hyperparameter space may change in general. 

The essence of all these definitions is that they make precise - in a setting with 
group symmetry and a set of parametric models - the quantum mechanical concept of 
complementarity. Different parametric functions may correspond to complementary 
aspects of reality. To get full information, we need to estimate all these parameters, 
which may be impossible with the same experimental units. As stressed already by 
Niels Bohr, this concept may be useful also in daily life and in a variety of sciences. 
Today, the concept is used for instance in psychology and in the social sciences, but 
its use in other natural sciences than physics is rather limited. 



3 Function spaces, invariance and irreducibility. 

Let Tl be any linear space of real or complex functions on This may be thought 
of as the space of 'wavefunctions' in quantum mechanics. Later we will assume that 
the group G introduces an invariant measure (Haar measuere) on and use as TC 
the Hilbert space L^($, v) or a proper invariant subspace of this space, but no such 
special structure is needed for this section. It will be assumed, however, that 7i is 
closed under the regular representation Ur{G) = {Ur^q)} of G on H, defined by 

URig)f{cP) = f{g-'(l)). (1) 

In the L^($, i/)-case this will be automatic, and the operators Uji{g) can be checked 
to be unitary. In general, the mapping g Uii{g) will be a homomorphism from G 
to the space of linear operators on 7i. Any subspace V oiTi which is closed under 
the operators Uji{g) defined by (||), is called an invariant subspace. 

A new class of permissibility-results will be of direct importance for the present 
paper: In a straightforward way an order-preserving link can be established between 
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the permissible parametric functions on $ and the invariant subspaces of H under 
the regular group-representation corresponding to the group G. As mentioned in the 
previous section, many parameters of quantum mechanics will be non-permissible, 
but in that case the result below can be used with the group G replaced by its 
subgroup Ga, defined in Proposition 1. The significance of this result lies in the 
fact that correspondance is established between parametric functions (a concept of 
statistics) and linear spaces (a concept of quantum mechanics). Later, we will also 
introduce a correspondance between specific parameter values and single functions 
in H. 

Before proceding it may be useful to look at the summary of group representation 
theory given in Appendix 1. As is well known, this theory is very useful for doing 
calculations in applied quantum mechanics, say in elementary particle theory. Here 
we will use the representation theory to study the foundation of quantum mechanics, 
much in the same way as it was used in Bohr and Ulfbeck (1995). 

The first of the following results were also proved in Holland (1998a). In general 
an invariant subspace F of 7^ is called irreducible if it does not contain any nontrivial 
invariant subspace. 

Theorem 1. 

(a) For a given permissible parametric function 9{-), the set V = {f Ti. : 
/(0) = f{9{(p)) for some /} is a sub-space which is invariant under the regular 
representation Ur. In the L^-case the space will also he closed. The space V depends 
only on the equivalence class of parametric functions 9{-). The invariant subspaces 
of this form will in the following he called parametric invariant subspaces ofTi. 

(b ) Every closed, invariant subspace V of a closed linear space Ti is contained in 
a parametric invariant space with 9{-) = {ui{ );i = 1,...), a basis for V . 

(c) Ordering of the parametric invariant subspaces under inclusion correspond to 
ordering of parametric functions under ^. The space V is irreducible within the class 
of parametric invariant spaces if and only if the corresponding permissible function 
9{-) is minimal with respect to the ordering :<. Nonequivalent minimal permissible 
functions correspond to irreducible parametric spaces with trivial intersection. 



Remark 1. 

In statistical modelling it is crucial to use a parametrization such that 9 is as small 
as possible with respect to the ordering ^ without loosing information, otherwise 
redundancy problems such as collinearity will occur. In practice, this also relates to 

problems such as near collinearity, since every statistical model is an approximation. 
According to Theorem 1, this is equivalent to having the invariant space V as small 
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as possible. This point of view will be very useful later, where it will be used for 
arguing that the basic Hilbert space should be chosen small in this sense. 



Remark 2. 

It may seem that V is typically trivially embedded into a very large parametric 
invariant space in (b). However, look at the case where V is irreducibly invariant 
of dimension d. Then, by using the Clebsch-Gordan series for polynomial functions, 
and taking limits in general, one can show that the embedding space can be linearly 
spanned by the matrix elements of the representing matrix, thus has dimension at 
most (f. 

Proof. 

(a) Let e{-) be permissible and let V = {f e H : f{(f>) = f{9{<p))} for some /. It 
is clear that this set of functions is closed under linear combinations. In the L^-case 
it is also closed under infinite sums that converge in L^-norm, implying that it is a 
closed space. If / G F, then 

so Uptig)/ € V. Hence the space is invariant. 

It follows directly from the definitions that 9{-) ^ iff tbe corresponding 

subspaces satisfy V CVi. In particular, equivalence of parameters implies equality 
of subspaces. 

(b) Let {uj(-); f = 1, 2, . . .} be a fixed orthonormal basis for V. 

For fixed (p, it is convenient to regard u{(p) = {ui{4i); i = 1, 2, . . .) as a vector in 
P. For each i and g we have that Ui{gcf>) = UR{g'^)ui{(j)) as a function of (?!) is a 
vector in V , and can therefore be expressed in terms of basis functions: Ui{g(l)) = 
Ylj Uij{g)uj{(l)), or u{g(p) = UR{g)[u{(j))] for some operator Uji{g) on P. From this 
last representation it follows that the vector function 9{(p) = (ui{(p);i = 1, 2, . . .) is 
permissible. 

Since /(^) = a^u{(f)), it follows that all vectors f EV are functions of u{-), which 
is permissible. Thus V is contained in the invariant space corresponding to 9{-). 

(c) The first two statements follow directly from the definitions. Assume that two 
non-equivalent minimal permissible parameters 9i and 92 have spaces with nontrivial 
intersection Vq. Then Vq must be an invariant space in the ordinary sense, and can 
without loss of generality be assumed to be irreducible (again in the ordinary sense). 
Let it have basis uq{-), and let the space corresponding to ^j(-) (i = 1, 2) have basis 
{uo{-),Ui{-)). Then by (b) we have 9i{-) = ipi{uQ{-),Ui{-)) for some function V'i- By 
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permissibility Ui{g4>) = gi{ui{4))) for group elements in the three groups Go, Gi 
and G2. Fix and take = V'j('"o(0)! "t(?^o))- Then one can verify from the 

definition that these are permissible, and this contradicts the assumption that ^i(-) 
and 6-2{ ) are minimal. 

Corollary 1. 

IfVi (i = 1, . . . , fc) are parametric invariant subspaces ofH corresponding to the 

permissible parameters 9i {i = l,...,k), then V = f\Vi is a parametric invariant 
subspace if and only if the minimum of {9i\i = 1, . . . , A;} with respect to the order- 
ing :<, is a permissible parameter, and in the latter case this permissible parameter 
generates V . 

Corollary 2. 

Assume fo{<j)) = f{9{(p)) for some permissible parameter 0. Assume that Ti is 
closed. Then the closure of {Uii{g)fo; g € G} is an invariant set in H contained in 
and spanning an invariant subspace of the space V corresponding to 9{-). 



4 Minimal model and spectrum. 

As already remarked, we will later regard the functions / as 'wave functions' in 
quantum mechanical models. However, already now we will, in agreement with 
Remark 1 above, introduce the following definition. 



Definition 6. 

A quantum mechanical model is called minimal under G if it is required that all 
functions f under consideration belong to some fixed G-invariant and G -irreducible 
space. We will from now on regard this fixed irreducible space as our basic space and 
denote it by Ai. 

In the case where Ti. is an L^-space, we can and will always take as a closed 
subspace. Everything that is said about TC above, also apply to Ai. 

Note that it is not required that the space M. should be a parametric invariant 
space; typically it will not be so. Hence we can not use directly the argument that 
a minimal model is narrow in a more conventional statistical sense. However, we 
shall have in mind the situation where we have several - typically non-permissible - 
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parametric functions {9a;a G A}, and hence several subgroups Ga such that in each 
case 6a is permissible with respect to Ga- Then we shall sec below that it is possible 
to make each 9a correspond to a parametric subspacc Ad"" closely related to A4, 
namely by restricting the wave functions to a certain subset of <I>, the prespectrum 
of 9a- Hence in this respect the model will be minimal also in the more conventional 
statistical sense. 

First it is clear that there will always be a linear subspace Va of Ti. consisting of 
functions of the form /(^) = f{9a{4>))- We will assume that this subspace is so large 
(under the ordering ^ defined above) that Va ^ M. It can be seen later that this 
corresponds to the wellknown quantum- mechanical concept of a maximal commuting 
set of operators. Since we in agreement with Definition 6 are only interested in 
functions in M, we will then in effect replace Va hy Ai, but then looked upon as a 
set of functions f{9a{-)), where typically the /'s are restricted to a proper subspace 
Qa of those determining Va- 

We will need a general, simple result on restricting a transformation group to a 
subset. 

Lemma 1. 

Let G be a transformation group on a set $ and let C. ^- Let Gq be the set 

of all g £ G such that g(j){) G $o o-n-d g~^4'Q € $o whenever <j)Q € <I>o- Then Gq is a 
subgroup of G and the maximal such which is a transformation group on $o • 



Proof. 

This is easily verified. 

One characteristic of quantum mechanics is that a variable connected to a bounded 
operator will have a discrete spectrum of possible values. In our treatment these 
variables correspond to parametric functions. It is possible that the following con- 
siderations in some way may be developed further to give an operator-free definition 
of the spectrum associated with a parameter. 



Definition 7. 

(a) Let in general f be a function defined on a set and let $o be a subset of^- 
The function f restricted to $o is defined as equal to the equivalence class of functions 
/o such that fo{(p) = f{(f)) for (/> G $o- ^ representing function corresponding to f is 
any member of this equivalence class. 
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(b) An 0-prespectrum $° of the parametric function 9a{-) is defined as a maximal 

subset of such that for all /(•) € Va, when restricted to ^>[J there is a representating 
function /o which belongs to Ai. An 0-spectrum 0° of 6a{-) is defined as ^a(#^) for 
some 0-spectrum 

(c) Assume Ti = L^{^, f) with v a probability measure (; for instance, $ could be 
compact). Then the prespectrum of 9a{-) is defined as a 0-spectrum which min- 
imizes l]i Var(ry(0)|0(j(-) = ^a(0i)) for any square integrable r], where the variance 
is with respect to v and the sum is over the 0-spectrum. The spectrum is defined as 

The essence of this is that the space M, when restricted to a 0-prespectrum is a 
parametric invariant subspace of the corresponding Hilbert space. The spectrum is 
just the 0-spectrum which minimizes unnecessary variance. 

Straightforward considerations show that the prespectrum and spectrum, when 
defined in this way, exist. A crucial point is the fohowing: If /j is a restriction of 
some / from ^ to some <I>j {i = 1, 2) and $i and $2 are assumed disjoint, then /i + /2 
is a restriction of / to $1 U $2- However, the 0-spectrum is not unique in general, 
as shown in the next section. Later we will associate a unique operator Aa to a 
parametric function, and then we will show that the spectrum of Oa{-) is unique (the 
prespectrum may be non- unique), is independent of rj and is equal to the spectrum 
in the ordinary sense of the operator Aa corresponding to Oa{-). 

In effect then, the spectrum of a parametric function can be defined from pure 
group-theoretical/ statistical considerations, without involving any operator. This 
seems to really open up the possibility of having a foundation of quantum theory 
only based upon set- and symmetry-concepts. When it comes to calculations, the 
operator will be useful, however. 

Some simple results follow. 

Proposition 2. 

(a) If (Pi e $° and ga G Ga, then gaCpi G If Oi G 9° and ga ^ Ga = OaiGa), 
then (jaOi e 6°. Thus each O-spectrum must consist of an orbit or a set of orbits of 

Ga. 

(b) The 0-spectra and in particular the spectrum of 9a{-) are the same when the 
hyperparameterspace is changed as in Definition 5. 



Proof. 

(a) The space A4 is invariant under /(•) fig-), in particular so for g = ga & Ga- 
Hence the definition of spectrum is invariant under (pi — > ga(pi- 
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(b) Easily verified. 
Corollary 3. 

Let Ai"" be the space of functions in A4 restricted to a prespectrum of 6a{-), and 
define Ti"^ correspondingly. Then M."' is a Ga-invariant parametric subspace of 71°'. 
Furthermore, $2 is a maximal subset of $ with this property. 



5 Spin 1/2 particles and the Einstein-Podolsky- Rosen 
paradox. 

As an example to illustrate the general theory of this paper, we give a nonstandard 
description of a spin 1/2 particle, a description which will turn out to be essentially 
equivalent to the one given by ordinary quantum theory. 

Look first at the general angular momentum case. A hyperparameter corre- 
sponding to this angular momentum may be defined as a vector in three dimensional 
space; the direction of the vector giving the spin axis, the norm the spinning speed. 
A natural associated group G will then be the group of rotations of this vector in 
TZ^ together with changes of the norm of the vector. With respect to this group 
one has two permissible functions tp — ||^|| and ip tp/\\tp\\. We consider the sit- 
uation where parametric reduction by means of the last function has already been 
performed, so we will from now on look upon ip as a unit vector, and the group G 
as the group of rotations of such a vector. 

We may want to have a hyperparameter space on which G is transitive; then we 
extend it so that € $ can descibe the orientation of any fixed three-dimensional 
object, for instance described by the three Euler angles. Then let ip he a fixed unit 
vector attached to this body. The nature of the solid body and the choice of ijj in 
it, do not matter. All these quantities - including and ip - are at the outset model 
quantities, and hence unknown. 

The most that we can hope to be able to measure, however, is the angular 
momentum component 0a(0) = I IV' 1 1 cos (a) in some direction given by a unit vector 
a, where a is the angle between ip and a. This measurement is a standard one 
using a Stern Gerlach device oriented in that direction. To be more precise, this 
device measures some quantity x whose distribution depends upon 9a, so there is 
some possibility that some part of the parameter 9a can be estimated from such a 
measurement. The rest of the (hyper-) parameter (p is obviously totally unavailable. 

The function Oa is easily seen to be non-permissible for fixed a. The natural way 
to make it permissible, is to restrict ourself to a subgroup Ga of G, which we always 



15 



may do. In the present case, the maximal possible choice of Ga is the group of 
rotations of the unit vector around the axis a together with a reflection in the plane 
perpendicular to a. (This is the group that takes the sets of the form Oa = const, 
into themselves.) 

This group has several orbits: For each k G (0, 1], one orbit is given by = fc 
and 9a = —A;; for A; = 1 these are degenerated. In addition there is an orbit for /c = 0. 

The obvious problem with this setup from a statistical point of view is the fol- 
lowing: The Stern Gerlach measurement can give only one of two values, say x = —1 
or X = +1. How can we hope to estimate a continuous parameter 6a G [—1, 1] from 
that little information? The solution offered here is outside current statistical theory, 
but seems sound from an intuitive point of view: Restrict the values of 9a for which 
inference shall be made to its spectrum as defined above. 

To this end, take first the Hilbert space H = L'^{ct),v), where v is the Haar 
measure of G on The group representation theory of the rotation group G is 
well known and described for instance in Hamermesh (1962). Wc will not go into 
details here, but choose one irreducible representation, the two-dimensional one cor- 
responding to a spin 1/2 particle. (We may, and will, disregard the fact that this 
representation is double- valued.) 

To begin with we may regard the irreducible invariant space A4 as some fixed 
abstract two-dimensional space, say with basis vectors (1,0)' and (0, 1)', and where 
the representation group U{-) is the unitary group SU{2) consisting of all complex 
unitary 2x2 matrices with unit determinant. One way to write these is 

U (g) = e~^'^9 with ag = gia^ + g2ay + g^a^, 

and where ax, Oy and are the Pauli spin matrices 

""^={1 J ) ' = ( i r ) ' = ( -1 ) • 

The space M can next be taken as a subspace ofTC = L^{^, f) by replacing the 
basis vectors (1,0)' and (0, 1)' with two fixed normalized functions /i((/<) and f2i<P)- 
It is convenient to take these as orthogonal in the L^-norm. The basic requirement 
is that span(/i, /2) shall be invariant under gcj). 

In fact we will not need the details of this to find the 0-spectrum of the parameter 
9ai-)- It is enough to know that A4 is two-dimensional: First note that the parameter 
space Qa is [—1, 1], and the corresponding group Ga is just the set of reflections in 
0. From Proposition 2(a), the spectrum must consist of sets of orbits of Ga, which 
are the pairs ±k for k G (0, 1] and A; = 0, which of course also can be seen from the 
description of the orbits of Ga given above. (Incidently, this part of the reasoning 
also holds for higher spin and angular momentum.) 
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In fact, for this example one may verify that any orbit pair it A; for < < 1 
gives a 0-spectrum. Using the spin-operator in the usual way we verify that the 
spectrum corresponds to A; = 1. In this case it is also obvious from the situation that 
this is the 0-spectrum which minimizes unnecessary variance. 

To make precise: The value of 9a before the measurement is a continuous param- 
eter as defined geometrically above. The values after measurement are restricted to 
take one of two possible values -1 and +1; let us call this O^. For simplicity we con- 
sider an ideal measurement x, after which we then can take 9\ = x. The probability 
distribution of the outcome x of the Stern Gerlach experiment turns out to depend 
only upon 9a as expected , and is given by 

= -1) = ^(1 - 9a), P(X = +1) = 1(1 + 9a). (2) 

The standard quantum theoretical derivation of these formulae may be found in 
several textbooks; the most important part needed is given in Theorem 9 below. An 
alternative motivation can be found from a symmetry argument: Looking at a simple 
figure, we see that the two probabilities are proportional to the Lebesgue measure of 
the parts of the diameter along a determined by the division point corresponding to 
9a- It would be interesting if this observation could be used to give an operator-free 
derivation of @. 

Note that it follows from equation ^ that E(x|0) = \9a-, a result that will be 
needed below. 

If ip is unknown, we get from (|2|) that the unconditional distribution is simply 
given by 

P(x = -1) = P(x = +1) = i. 

This is true whether we use a prior for ip from G, from Ga or from its restriction to 
the spectrum. When discussing the EPR problem below, however, it does have an 
impact which of these groups we use to define a prior. The correct solution is then 
to refer to the measurement model that was actually used, hence use the refiection 
group as restricted to the spectrum ±1, hence probability 1/2 on each of these two 
values. 

Consider then the Einstein-Podolsky-Rosen situation (as modified by D. Bohm), 
where two particles previously have been together in a spin state, so that they - in 
our notation - later have opposite spin-vectors ip and This situation has been 
very much discussed in the literature, where the focus often has been on the fact 
that one here gets correlated spin component measurements even at great distance. 

This in itself is not really surprising, however, since the two particles have a 
common history. Correlation through a common history can be found in many 
instances also in large-scale statistics. A much more serious difficulty is that Bell's 
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inequalities preclude explanation of the correlations via ordinary hidden variable 
models. 

But even this difficulty is solved by letting the prior depend on the choice of 
measurement as described above. Let an observer A measure the spin of one particle 
along a direction a, and at some distance from this, let another observer B measure 
the spin of the other particle along some direction b. Let u be the angle between the 
directions a and b. 

Let us first describe the situation from the point of view of the observer A. The 
point is that his measurement will give some information on the previous value of 
ip, which in principle can be recovered using a Bayesian argument. We will not 
need this argument, however; it is enough that the two measurements are correlated 
through a common ip. This common is not a physical (hidden) variable, though; 
it is a hidden parameter, and as such it has only meaning when coupled to the 
measurement model of one or the other observer. 

As explained above, the observer A will have a prior on ip given by a probability 
1/2 on 9a = —1 and a probability 1/2 on 9a = +1, where 6a is the cosinus of 
the angle between a and (p. This is equivalent to a prior on ip with probability 
1/2 on each of a and —a. Let b-^ be a vector in the plane determined by a and 
b, perpendicular to b. Then, from the point of view of A, ip will have a prior 
concentrated on a = bcosu + b^smu and —a, so that the prior component of —ip 
along b will be — ^acos(u), again taking two opposite values with equal probability. As 
before, E(xa) = E(xb) = 0, where Xa and are the two measured spin components, 
and we find 

E(ar„Xb) = E(E(x„Xb|V')) = E(0„ • (-0„cos(n))) = -cos(tx), (3) 

so that the correlation between the spin component measurements is — cos(?/), the 
same answer as given by ordinary quantum theory calculations. 

The above calculation was made from the point of view of A, using his prior. 
It is easily seen that the same answer is found from the point of view oi B. It is 
crucial that the argument requires no action at a distance, contrary to what seems to 
be claimed even in recent papers discussing quantum entanglement (e.g., Buchanan, 
1998). 

A rather naive analogy could perhaps explain what in my view is going on here: 
Two persons A and B have grown up together and gone to school together, but later 
they loose contact. At some point they are then picked out for a certain psychological 
test together with some other people, and the results for A and B turn out to be 
correlated. This can of course be explained roughly by pointing at the common 
background, but pressed to go more into details, A gives his explanation and B gives 
a slightly different one. One can in fact easily imagine that the issue is so complicated 
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that there exists no detailed 'objective' explanation: only two complementary views 
exist. In the same way, the correlation in the EPR experiment can not be explained 
by some 'objective' hidden variable, but the two complemantary priors are both 
consistent with the observed fact. 

Related conclusions on the EPR-experiment were recently reached by Deutsch 
and Hayden (1999) using the Heisenberg picture to analyse quantum information. It 
may be of some interest to compare briefly these two approaches. Deutsch and Hay- 
den make extensive use of the cnot operation which is a logical operation connected 
to the spin measurements. In particular, each of the two spin component measure- 
ments are associated with a cnot operator, which can be interpreted as described 
above. A crucial point, however, is that a third cnot operator is needed after the 
two measurements. This may be linked to a comparison of the result found by A 
and the result found by B, and is thus directly connected to the correlation found. 
From a statistical point of view, the usual model for a measurement is an interaction 
between two players: nature and the observer. In the present case it may to some 
extent be fruitful to introduce a related concept to describe the interplay between 
two observers with different priors. At least this may seem to be a way to make the 
argument of Deutsch and Hayden (1999) less formal, and thus making it reasonably 
close to the argument sketched above. 



6 Hyperstates and hyper parameters. 

While states of a system in statistical modelling are most naturally given by points 
01 in some parameter space in quantum theory they are given as rays in some 
Hilbcrt space. We will choose here to represent the Hilbert space by 7i = L^{^, f). 
The question is: Can one associate in a natural way a unique ray represented by a 
vector f^^ G 7i to every £ ^7 A simple informal solution might be to let /^^ 
be S{(f) — 0i), but this is an improper function, and does certainly not belong to H. 
A much better solution is to make use of the group structure associated with the 
Hilbert space. We will assume here that the group G is exact and transitive, so that 
for any pair ipi) there is exactly one g £ G such that 4>i = gcpQ. A more general 
discussion of the same concepts may be found in Perelomov (1986). A brief comment 
to the general situation will be given at the end of the Section, and for derived groups 
situations with non-transitivity will be common later. The main results here are: 
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Theorem 2. 

Assume that the group G is exact and transitive on a parameter space and let 
{Unig)} be its unitary regular representation onTt = L'^{^,u). 

(a) Fix foGTi and (po € ^. There is then a homomorphy from the parameters 
E ^ ( with group elements g) to the set F of functions f^^ ( with group elements 

Unig)) which is given by f^^^icj)) = /ol^rV) for (j)! = gifpo- 

The set F is invariant under {UFi,{g)}, and F is contained in and spanning the 
least invariant subspace of TC containing /o . The functions f^^ are unit vectors in 
Ti. This homomorhy is an isomorphy if we assume that no nontrivial f E: H is a 
strictly periodic function. 

(b) Let V\ be an irreducible invariant subspace of Tl of dimension larger than 
1 such that Vi n F is non-trivial. Assume that no nontrivial subgroup of G has a 
representation of dimension 1 in Vi. Then the vectors in Vi D F lie in different 
one- dimensional subspaces, and these span Vi. 

(c) There is a larger space $' I) $ such that every one- dimensional subspace V\ 
of TC is a pure state in the sense that there is a (p' & that is mapped upon V\ in 
the way described in (a). The group G can always he extended to a group G' such 
that $ and \ <I> are different (unions of) orbits for G' . For any such extension, 
and corresponding extension of Haar measure v to v' , we have that 7i, extended by 
putting functions equal to on ^' \ is an invariant subspace ofH' = L^(<&', u'). 



Proof. 

(a) We have that 

^«(5)/^i(<A) = /o(5rW)= 

/o((55i)-V) = /,<^x(0). ^ ' 

Assume that this homomorphy is not an isomorphy, so that f/i?(5i)/o = C^R(fl'2)/o 
for some gi / 52- Then UR{g)fo = fo for g = gig2^, hence /o(9~V) = foi<P) for all 
(j). This is only possible if fo is strictly periodic. The rest is easy to prove. (Compare 
also Corollary 2.) 

(b) The functions constructed above arc of the form = fo{gi^<p) if ^1 = 
gicpQ. Assume that two of these, say /<^^ and in the same one-dimensional 
subspace. Then, by the linear independence of the basis vectors we get UR{g2)fo = 
cUR{gi)fo for some scalar 07^!, thus f/i?(5jf ^5'2)/o = c/o- By taking norms, we find 
|c| = 1. Assume that c is a complex number different from 1. This implies that the 
cyclic group generated hy g = g2 has a onedimensional irreducible representation 
on the vector-space spanned by /o, which leads to a contradiction. 

We note that as gi varies, UR{gi)fo = fg^^^a span Vi, since if it spanned a smaller 
space Vq, this would be invariant under the representation. 
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(c) Take \ $ as the collection of unit vectors /(•) not contained in F, and let 
G' be any group on this set. 

These results are essential for what wc want to show in this paper: It is possible 
to establish a relationship between the concepts of theoretical statistics, based on 
unknown parameters in some parameter space, and the concepts of quantum theory, 
based on state vectors/ rays in a Hilbert space. When it comes to interpretation, it 
seems that the parameter concept is easier to understand in most cases, even though 
physicists over the time have developed much intuition related to the Hilbert space 
paradigm. 

The construction of this Section is closely related to the theory of coherent states, 
which go back to Schrodinger and von Neumann, and which has been developed and 
applied in great generality recently, see Perelomov (1986). The simplest application 
is to the Heisenberg-Weyl group related to the coordinate and moment operators. 
Another relatively simple application is to the rotation group of three-dimensional 
space; a special case of this has already been discussed in Section 5. 

In the above discussion we have for simplicity assumed that the basic group G is 
exact. In general, non-exact groups can be handled by defining the isotropy group 
Go as the set of 5's such that /o(5^) = /o(^) (modulo a phase factor), and then 
replacing G by G\Gq in the definitions above. Typically, this option will be needed 
when we go to concrete applications. A related procedure may be used when we 
come to non-permissible parametric functions. 



7 General statistical background. 

The ordinary statistical framework for modelling experiments will be used: A sam- 
ple space X listing the possible outcomes of the experiment, a a-algcbra (Boolean 
algebra) of subsets of X, giving the events to which probabilities can be assigned 
and a class {P^} of probability measures on {X,J^). In statistics the parameter 9 is 
usually thought of as the unknown quantity in the sitTiation. The statistical concept 
of state of the system will then either be a fixed value of ^ or a probability measure 
over the parameter space Q. 

Consistent with the above discussion, we add the following feature, which may 
be of value both in statistical theory and in quantum theory: In a given situation 
we may have the choice between several potential experiments, sometimes mutually 
exclusive. Let a & A index the possible experiments. Then the basic mathematical 
construct will be {{Xa,J-'a,{Pa'"i^a G @a});a G A}. Though some may think that 
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this is a more elaborate construct than the ordinary Hilbert space basis of quantum 
theory, it has the advantage that each element has a concrete interpretation, and 
that it is essentially the same framework that is used in the analysis of macroscopic 
experiments in a large range of empirical disciplines. 

As before, we assume that each 9^ is a function of some hyperparameter cf), 
varying over a space $. (in fact, by taking $ large enough, this is no essential 
restriction) and make symmetry assumptions: Let G be a fixed symmetry group on 

It is assumed that the space $ is closed under the transformations in G. Often 
the structure of the parameter space is such that there is a very natural choice of 
G: If $ consists of only location and scale parameters, say, then the corresponding 
location and scale groups are used; if direction parameters are added, then this may 
make it natural to adjoin rotation group(s) and so on. 

A group acting on some space $ is said to be exact if there is at most one group 
element g transforming any given point <j)i to any other point (/>2. The group is 
transitive if there always is at least one such g. Nontransitiveness implies several 
orbits in the space this corresponds to superselection rules, which can be tackled 
by taking each orbit separately. (This must not be confused with the nontransivity of 
the subgroup Ga which was encountered in the previous sections, and will be further 
discussed later.) Non-exactness in an ordinary statistical setting was discussed in 
Helland (1998b). 

Assume that the hyperparameterspace $ is locally compact, has a right Haar 
measure v with respect to G and satisfies weak requirements such that the space 
7i = (<&,!/) of complex square intcgrable functions on <I> is a separable Hilbert 
space. This will be the Hilbert space on which we will start our investigation of 
quantum theory. One should have in mind, however, that all separable Hilbert 
spaces are equivalent, so it is possible to think of a general, abstract Hilbert space. 
Later we will switch our attention to a fixed irreducible invariant subspace of H.. 



8 Experiments. 

Consider first the traditional statistical setting where one concentrates on one sin- 
gle experiment. Let x be some random variable of this experiment, and let G* be 
the group on the sample space corresponding to G on the parameter space, so that 
'Ei'^{h{x)) = 'Ej^'^ {h{g* x)) for any function h. This gives a natural homomorphy from 
G* to G. Most of the physical literature concentrates on ideal experiments where one 
disregards experimental uncertainty, and thus the result x of the experiment defines 
new values of the parameter, a new state of the system. For such systems we can 
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disregard measurement values x and concentrate on parameters and hyperparame- 
ters. An important special case is when the experiment just consists of measuring a 
parameter 6{<j)). For the most part we can think of such highly informative exper- 
iments in this paper. To put things into a proper setting, however, we will briefly 
sketch some points from standard statistical theory, concentrating on the situation 
with an underlying symmetry group. 

One very common purpose of an experiment is to estimate the unknown parame- 
ter value 9{cj)), say by some function of the data 6{x). There is a considerable theory 
on how to choose 6 in some optimal way, see Lehmann (1983). The more formal 
approach to this problem is to define a loss function l{9{x);9{(l))), which gives the 
loss - expressed in some way depending upon the situation - when the estimator 9 is 
given. The choice of 9 is made by minimizing /. 

It was shown in Helland (1998b) that this problem was considerably simplified 
when 0((/>) was a permissible parameter, in particular certain classical paradoxes were 
avoided. Then it is also natural to concentrate on permissible estimators, satisfying 
the requirement that 9{xi) = 9{x2) implies 9{g*xi) = 9{g*X2) for all g* . 

Let I be a loss function satisfying the invariance requirement 

mx);9{<P)) = mg*x);9{g^)) 

for all g* . (This is equivalent to l{ga,gb) = l{a,b), where g is the group element 
induced by g on the image space of the permissible functions 9{-) and 9{-).) Then, 
as in Helland (1998b), Corollary 9, we have that the expected loss is independent of 
tehparameter (p and given by 

E^mxy,9m) = J i{m;9mp^{x)v{d<p), (5) 

where v is Haar measure, and the probability density p'^{x)is normalized so that 
/ p'^{x)v{d(j)) = 1. The righthand side of (^) is independent of x. If the data group 
and/or the parameter group is nontransitive (superselection variables), the equation 
is taken conditionally on each orbit. 

The consequence of this identity is that one can use standard tools to minimize 
the righthand side of (|5|). If / =|| 9{x) — 9{<i)) |p the solution is simply 

e{x) = j 9{(j))p'^{x)u{d^) = J 9q\x)9{d9), (6) 

where p'^{x) = q^^'f'^x) and u is Haar measure on the image space of 9. This is thus 
the minimum loss solution, and it is also the socalled Bayes solution under Haar 
prior. 

In other cases, one is interested in conclusions to the effect that the unknown 
parameter 9{(j)) belongs to a certain interval. When 9{-) is permissible, the two basic 
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such interval types, confidence intervals and credibility intervals (again under Haar 
prior) coincide. 

We will from now on assume that the experiment performed is ideal, in the sense 
that we arrive at a conclusion of the form 9 {(f)) G C, or rather, even more ideally, 
0{(p) = 6i and that we consider this conclusion as certain. We may also consider 
statements of this form when they are not results of experiments, say, hypothetical 
statements in a discussion, or conclusions arrived at from other types of information. 

9 An ideal experiment: measuring a permissible param- 
eter. 

In general, let 6{-) be a permissible parametric function on a (hyper)parameter space 
subject to a group G. We may to begin with consider the specific values (pi £ ^ as 
the pure (hyper-)states of the given system; a general hyperstate is then a measure 
over When there is no information about the system, a natural choice of measure 
is the Haar measure, in other cases the Haar measure over an orbit or over a set of 
orbits. Note that the Haar measure over an orbit is unique (up to a multiplicative 
constant); when several orbits are involved, each can be given an arbitrary weight. 

By what has been discussed in Section 7, all this can be translated in a unique 
way to a set of rays (one-dimensional subspaces) in an invariant subspace of Ti. = 
L^($,j^). The hyperparameter (pi corresponds to the ray generated by /0i(-) = 
foidi^') with (pi = gi(po. The group elements g on $ correspond to the unitary 
representation operators Uji{g) on H by equation in Section 7. The set of points 
in an orbit of G in $ correspond to the set of one-dimensional subspaces represented 
by {UR{g)f(f)^{-);g G G}. The situation we will have in mind in this paper, is that 
(pi is impossible to estimate directly; only certain functions 9 {(pi) are available. 

Let now 6{-) describe the parameter of some particular experiment. Then, by 
Theorem 1, 6{-) determines in a unique way a (parametric) invariant subspace V 
of 7{. In the same way as in Theorem 2(a), the different values of 6i = 0{(pi) can 
be associated with different functions of 9 that are unit vectors of V when V is 
considered as a Hilbert space on its own right. Note that the different Hilbert spaces 
of this kind are consistent with the original Tl, since Haar measure v on ^ induces 
a new Haar measure D on the image of the permissible parameter 9{-) in such a way 
that 



We will start by discussing an ideal experiment in the parameter language, which 
is easiest conceptually, and then afterwards translate it to the vector space language. 
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Let the (mixed) hyperstate before the experiment is performed be given by a 
probabiUty measure it on Let 9{-) be the target parameter of the experiment, 
assumed to be permissible. 

Suppose that the ideal experiment results in some value for the target pa- 
rameter. Let $0 be a collection of 0- values such that 6 {(f)) = Oq. Then after the 
experiment, the basic group is changed to Gq = {g : 6{g4>) = 9{4>),\/4> & ^o}, which 
is a subgroup of G by Helland (1998b), Proposition 5. The cosets gGo may be iden- 
tified with the elements g of the group G of transformation of ^-values introduced 
earlier. 

More generally, if = {4> ■ 0{(t)) = 9)} and Gq = {h : e{h(j)) = 9{^)y^ € $4, 
with 9 = g9o, then <I>e = g^Q and Go = gG^g^^ (which is the same for all g that are 
mapped into the same g £ G). The sets and also the cosets gGo will be disjoint 
for different g, but the groups Gq will overlap. The intersection of all Go will be a 
normal subgroup K oi G. 

An important observation from the above results is: 



Proposition 3. 

(a) Let the right and left Haar measure be equal. (For instance, the group could 
be compact.) Then the Haar measure v' for with respect to Go is independent of 
g, and therefore of the value of 9 . 

(b) As a consequence, with 9 as above, such that (p = if^^'P') with (p' G we 
have, v{d4>) = t'{d9)v' {d4>') after suitable normalization of the Haar measure on G, 
G and Go, respectively. 

Thus in this sense the setting for new measurements is independent of the value 
of previous measurements. This result is crucially dependent upon the assumption 
that the parameter 9{-) is permissible. 

The fact that the analysis of consecutive measurements is relatively simple for 
permissible ideal measurements, can also be related to the following observation, 
already mentioned: If 9^'^\-) and 9^"^^-) both are permissible, then the compound 
parameter 9{-) = (^(^^(•), ^(^)(-)) is also permissible. Thus results of the type above 
also apply for the compound parameter. The situation after the ^(•)-measurement 
is the same whether 9^^\-) is measured first or 0(^)(-) is measured first. A similar 
observation holds for more than two measurements. However, the probability dis- 
tribution of 9^^^ will in general not be the same whether 9^'^^ has been measured or 
not. In the ordinary language of quantum mechanics, we may have non-commuting 
variables also in this case. 
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Here is a tentative definition of a state of a system: 



Definition 8. 

A (permissible) state is specified by 

1. A permissible parametric function 6{-) giving the focus of interest, or the quan- 
tity we choose to - and are able to - get information on. 

2. A probability measure tt on the range of 6{-). A special case is the complete 
specification 9{(p) = 9i. 



Note that this definition extends the previous definition of a (hyper-)state as a 
hyperparameter (pi or a measure over In the situation we have in mind, where 
it is impossible to get full information on (j), the states of Definition 8 are of more 
interest. It is an important observation that the pure states of this kind are in 
one-to-one correspondence with 'wave functions' of the form f{<p) = f0^{6{<p)); see 
below. 

In the situation mentioned above, when both 9^^^ and 9^^^ are permissible, a state 
described by the product measure tci ® D2, where tti might be degenerate and z>2 is 
Haar measure, is considered equivalent to just the measure tti on the range of 9^^\-). 
A total non- informative state is just Haar measure on $. 

If the parameter 9{-) shall determine a state, it is often wise to try to make it as 
large as possible with respect to the ordering :<. 

Now we will try to translate this to the vector space language. This may seem like 
a rather meaningless exercise when we already have simple formulae for everything 
of interest. But the point is that we want to make a link to the formal aparatus of 
quantum theory. By doing that in the present simple situation, we will also be able 
to transform this link to more complicated situations. 

As the choice of a basic Hilbert space we take H = L^($, z/). 

We intend to focus on states, not on hyperstates. Then the relevant functions 
are those projected upon the space V as in Theorem 1. 

In particular, this means that the parameter space is changed from $ to O. Math- 
ematically, it will be important now that the underlying function 9{-) is permissible. 
It is this property which implies that is is possible to transfer a group symmetry 
from the first space to the second space. 

The conditional expectation (in parameter space) is given by the projection from 
H = L^($,z/) to the invariant space V of functions f{9{-)) (cf. Section 3). This 
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projection operator has the following mathematical description: It is the operator 
P^( ) on n defined by 

where f{6) is the Radon-Nikodym derivative of f{(p)iy{d(p) with respect to D{d9), the 
Haar measure on the image space Q of 0{-). Explicitly, f{6) = J^^ f{(f)')u'{d(f>'), since 
v{d4>) = i){dO)u' [dcf)') (assuming u'(<^q) = 1). Thus ) smooths / over each single 
set (; G 0), and is the correct operator if we want to change the parameter from 
to 9. In more statistical language is the conditional expectation of /(•), 

given ^(•), looked upon as random variables. 



Theorem 3. 

Assume Haar measure on $ such that v'^^e) = 1. Then P^(') as described above 
is a projection operator, and projects upon the space V ofH corresponding to the 
permissible parameter 9{-). 



Proof. 

Let f{(f))u{d(t)) = f{9{(t)))D{d9). Then for any integrable function c(-) on 6 we 
have 

j c{emf{<t>)<dct>) = I c{9)mu{d9). 
Use this identity on c{9) = g{9)* — f{9)* to find after some manipulation 

/ \f{ci>)-g{9mMdci>)- 1 \m-f{9mMd<i>) = I \g{9)-m\'i>{d0). 

This shows that 

/ !/(</.) -/>(0))PK#)< / {/{(!>)- g{9mMdcl>) 
for any g, so Pefi-) = f{9{-)) is the projection of / upon V. 



Recall that we consider a situation where the hyperparameter cp is unobservable, 
while certain parameters 9{-) are observable. Also recall that = f<po{9~^') for 

a fixed and = g(f)o. For = d{^o) it is therefore natural to define 
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with ) as in Theorem 3. Note that fe^ is a unit vector in L^(0,z>) when f^^ is 
a unit vector in L'^(<I>,i^). Let Ur{-) be the regular representation of G induced by 
Ur, such that UR{g)f{e) = f{g~^6). Taking fe^ = U{g)fdo for 6*1 = 56*0 in a similar 
way as in @, we get in a unique way fg^ = P^^ Vflii foi' ^1 = &{'i>i)- Thus in the 
permissible case, in this sense a description in i^) is equivalent to a description 

in L2(e,z>). 

Note that the mapping g ^ g is a, homomorphism, so 5 — > also gives a 

representation (on V) of the basic group G. 



10 Quantum operators for permissible experiment. 

In accordance with ordinary quantum mechanics, it is natural to define the density 
'matrix' p for the mixed state corresponding to a probability measure 7r(-) over as 

P = j fe-jX^dOi), i.e., p(</.,(/>') = I je.iemkm'W^idei). (7) 

This is consistent with the fact that fo-^ is a pure state vector corresponding to the 
parameter value Oi. An objection might be that these vectors are not orthogonal in 
general, so the corresponding projectors do not usually commute. Nevertheless, the 
integral is well defined and gives a trace 1 positive definite operator p as it should. 
By choosing new basis vectors that are orthogonal, it can be expressed in a more 
ordinary form. Similar constructions are made in the theory of coherent states; see 
Perelomov (1986). 

Even though there thus is a parallel to ordinary quantum theory here, and more 
parallels will turn up later, there is also a formal difference: The definition of p 
depends crucially upon the choice of the function fg^ such that /^^ = UR{gi)fg^, 
so this function must be held fixed. More precisely, for the formalism that we now 
are going to develop, one should fix once and for all a function fo{(p) = f,j)Q{(f)-, and 
then project it upon the various spaces corresponding to permissible parameters as 
described above. 

The next step is to try to find an operator corresponding to a new parameter of 
interest r] = ri{(j)). A first attempt might be to define the 'diagonal' operator by 

Here a problem arises, however: In general, this will not be well defined, again 
since the fg^ 's are non-orthogonal. However, what we can do, is to define A^i such that 

^vfdi ~ E(??(</>)|^ = Oi)fei + with fg^ee-^ = 0, where the conditional expectation 
assumes Haar measure on each $51. 
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A far more general result is given by the following theorem, which most certainly 
also can be generalized beyond the case of a compact group. 



Theorem 4. 

Let Ur be the regular representation of a compact group G on Tl = L'^{^,u). 
Fix fo^zTC and let c(-) be an integrable function on G. Assume that the projection 
fr = Prfo on each irreducible subspace Vr of TC for which J c{g)Ur{g)i'{g) 7^ 0, 
where Ur{-) is the blockdiagonal part of Uii{-) corresponding to Vr- 

Then there is an operator A on 7i, unique except on irreducible subspaces where 
fr = 0, such that 

{UR{g)fo)^AUR{g)fo = c{g) (8) 

for all g ^ G. 
Proof 

Let Ur{-) {r = 1,2,...) be the blockdiagonal irreducible parts. Then it follows 
from Theorem Al in Appendix 2 that a unique Ar can be found for each r such that 
Qr = {Ur{g)fr)^ ArUr{g)fr Satisfies J qrig)Ur{g)iy{dg) = ^ c{g)Ur{g)v{dg). Theorem 
Al was derived in a P setting. The corresponding L^-result is derived straightfor- 
wardly via the connection /(■) = v^u{-) for a basis «(•) = (tii(-), U2(-)) • • •)• 

By choosing A to be blockdiagonal with blocks A^ in the representation where Ur 
is blockdiagonal, it follows that the two sides of (^) have the same Fourier transforms, 
and hence must be equal. An application of Schur's first Lemma (Appendix 1.) shows 
that A cannot have any non-blockdiagonal contribution in this representation. Hence 
A is unique as stated. 

An obvious remark is that Theorem 4 also is valid for Ti. = L^(G,t'), i.e., when 
the basic parameter is 9. Then U{g)feo = U{g)fo = fo^ with 61 = gO^. Let rj{4>) be 
some general function of the hyperparameter. Then it follows from Theorem 4 that 
under weak conditions there exists an operator Ay^ such that 

flAje,=nrime = 9^) (9) 

for all ^1, where the expectation on the right hand side is with respect to Haar 
measure over the remaining parameters. 

For a state defined by the parameter 0{-) and a probability distribution yf (•) over 
0, hence by a density operator p, given by (|7|), we have 

j E(r?|^)^((i0) = tr(^^p). (10) 
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This gives a well known quantum mechanical formula derived in an essentially clas- 
sical setting. 

It was said above that the state vector p was only defined uniquely mudulo a fixed 
function. Looking at how the operator Q depends upon the choice of v in Theorem 
Al, we get a similar arbitrariness in the definition of Aj^ in each irreducible space. 
The result is that tr(A^/>) is uniquely defined, first proved if the space is irreducible, 
then in general. 

We will see later that the formula (10) can also be made to be valid in the non- 
permissible case, hence it is generally true in our approach to quantum mechanics. 
The formula can in general be derived from, and is hence equivalent to, a special 
case of it, where expectation here is replaced by probability. The corresponding 
quantum theory statement is considered to be an independent axiom of the theory, 
as recently pointed out by Deutsch (1999). Deutsch aims at deriving it from the 
other postulates of quantum theory, but his arguments are countered by Finkelstein 
(1999). In our approach this formula is automatically satisfied by the construction 
of the operator involved. 

The same formula holds for expressing approximate expectation of random vari- 
ables, and this variant of the formula may be potentially useful for statistical appli- 
cations, in particular when we need to approximate models with many parameters. 



Corollary 4. 

Define the approximate expectation for a random variable X, with a distribution 
depending upon when 0{(j)) = 9i as 

Ee,{X) = f E^iXyidcp'). 

Then (under the regularity condition of Theorem 4) there exists an operator Ax, 
independent of all parameters, such that 

EeAX) = flAxfe, (11) 
for all 9i; or, if there is a prior 7r(-) on B, then 

E{X) = j E^{Xy{dct>')^{d6) = iT{Axp), 

where p is given by equation It is assumed that the measures v' and tt are 

normalized to 1. 
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Let for fixed 6i, Pe^f be given by ^{f{(j))\6{(j)) = 9i}, wliere, as in Theorem 3 
and Theorem 4, the probability measure on = {0 : 0{(f)) = ^i} is given by Haar 
measure. Similarly, for C C ©, let Pq be the operator which according to Theorem 
4 is given by fl^Pffrp^ = 1 if 6'(</)i) G C, otherwise 0. 



Theorem 5. 

Assume that Haar measure is used on each ^g. Then P[0 G C] = ti (pPf) in the 
state given by and also in the hyperstate given by p = J f^^fj^^7r{d4>i). 

If f is an integrable function, one can define ^/(.) = / f{(j))dPg. Then E(f{(j))) = 
tr(p^j(.)) when the state is p. The operator ^/{.) will be hermitean if f is real, 



unitary if f{-)*f{-) = 1. 
Proof. 

We can write Pg^ = QgP^^'\ where fe^Q'^fej_ = 1 if S C, otherwise (cp. 
Theorem 4). We then have 



The proof for the hyperstate case is similar. 

The last part follows by a suitable limit argument. 



11 Non-permissible parameters; some preliminaries. 

The setting is as before: There is a hyperparameter space $ representing all (un- 
known) quantities in a given system, a group G acting on a Haar measure i/ on $ 
and the basic Hilbert space Ti = Lp'{(^,v). We will allow the hyperparameter space 
to have a rather arbitrary structure. 

The general natural procedure when 0{-) is non-permissible, is to reduce the 
group. It follows from Proposition 1 in Section 2 that this can always be done in 
such a way that 6{-) becomes permissible with respect to the new group. 

The following technical results may give some of the background needed in general 
for a proper discussion of non-permissible parameters: 



tTipP^) = tTl feJlP^iride^) 



I 
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Lemma 2. 

Let {U{g);g € G} be a unitary representation of G on a vector space V. Let Vi 
be a subspace ofV, and define Gi = {g G G : U{g)v £ V\ whenever v G 14}. Then 
G\ is a subgroup of G, and {U{g);g G Gi} is a representation of Gi on V\. 



Proof. 

It is obvious that g\g2 G G\ when g\ G G\ and (72 G G\, since U{gig2) = 
U{g\)U{g2)- Let g G Gi. Note that Vi is an invariant space for the set of unitary 
operators {U{g);g G Gi}, and so we also have that V^- is an invariant subspace. 
Suppose that U{g~^)v has a component 7^ in V-^ for some v £ Vi. Then 
U{g)~^v = vi+ V2 with vi G Vi and V2 G V-^, and f = U{g)vi + U{g)v2. This last 
equation implies U{g)v2 G Vi, which gives a contradiction to the assumption that 
V2 ^ since Vf^ is invariant. Therefore: U{g^^)v G Vi for t> G T^j and g~^ G Gi for 
g E Gi. Thus Gi is a group. 

It is clear that {U{g);g G Gi} gives a representation of Gi on Vi. 



Proposition 4. 

Let {U{g);g G G} 6e a unitary representation of G on a vector space V. Let 
dim(y) = k (possibly infinity). 

Let Wi(BW2(B ■ ■ ■ be an arbitrary decomposition ofV in orthogonal subspaces. Put 
Vj = Wi® . . . ^Wj for 1 < j < k, and let Gj = {g e G : U {g)v G Vj when v G Vj}. 
Then: 

(^aj Gi C G2 C . . . C G, and all Gj are subgroups of G. 

(b) {U{g);g G Gj} is a representation of Gj upon Vj. 

(c) If Vj is a parametric invariant space {j = 1,2,...), corresponding to the 
permissible (with respect to Gj) parametric function Oj{-), then 61 O2 :<...:< (f). 



Proof. 

Follows from Lemma 2 and Theorem 1. 



This Proposition gives us a way to handle non-permissible parametric functions 
in concrete cases: Find a subgroup Ga of G so that the parameter is permissible 
with respect to Ga- This is always possible, and in many cases it is easy to find such 
a Ga- In general we can let Ga be the maximal group such that 6a{-) is permissible 
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with respect to Ga] cf. Proposition 1. The procedure is to do all calculations 
as if this was the relevant group. Note first that v also is a Haar measure with 
respect to Ga- Let Ua be the regular representation of Ga on L^(<I>, i/), and let Va be 
the parametric invariant space with respect to Ga corresponding to the parameter 
Oa{ )- Use Theorems 3, 4 and 5 with the reduced group. Using the construction of 
Proposition 4 on {Va}-, and letting {U{g)} there be the regular representation, it is 
seen that the two definitions of Ga coincide. 

The definition of a state (Definition 8) can also be generalized to the non- 
permissible case. Thus we redefine a state to be any parameter Oa{-) together with 
a fixed value for this parameter (pure state) or a probability measure (mixed state). 
A final presision is given by the following: To define a pure state in this way, we 
require that Oa{ ) is so large with repect to the ordering ^ that the corresponding 
space Va contains the state space M.. 

The result of all this may be combined with what is found by using the original 
group G. 

We need a generalization of Theorem 4, and formulate some consequences of this 
generalization. 

Theorem 6. 

Let V be any subspace of Ti = L?'{^,v) which is invariant under the regular 
representation Ur of G. Let fo &V have non-zero projection upon every irreducible 
invariant subspace Vs of V . Let c{g) be any v-integrable function which can be 
expanded in terms of the matrix elements of the irreducible representations: 

c{g) = J2as,,jUs,ij{g'^). (12) 

Assume as,ij = for all s such that Vsr\V = {0}. Then there is a unique operator 
A on V such that 

{UR{g)fo)^AUR{g)fo = c{g). 

Proof 

The irreducible representation matrices satisfy the orthogonality relations (Hamer- 
mesh, 1962) 

dr J Us,il{g^^)Ur,jm{g)^[dg) = SrsSijSlm- 

The relation ([T^ ) therefore implies 

J c{g)Ur,jmig)'^idg) = drarjm, 
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which vanishes when VrDV = {0}. Therefore A can be defined as an operator on 
V. 



It is now time to make an important simphfication. Recall Remark 1 after The- 
orem 1, where it was said that for teh purpose of accurate inference it is usually 
wise if possible to make the statistical model narrow in the sense that the parameter 
6{-) should be small with respect to the ordering ^. Also recall the definition 6 of 
a minimal model. In the following , we will assume such a minimal model, i.e., the 
basic space ^A will be assumed to be irreducible with respect to the group G. 

The following theorem will be fundamental for the further development of teh 
theory. 

Theorem 7. 

Let A4 be an irreducible invariant subspace of TC = L^(<I>,z^) under the compact, 
transitive and exact group G. Fix 0o ^ o-nd let fQ^Obea fixed element of M. 
Define f^^ = UR{g)fo when 0i = gcpo- Let 9{-) = u{-) = {ui{-),U2{-), ■ ■ ■), a basis 
for A4, and let q be a v-integrable function on the range G of 9{-). Then there is a 
unique operator A on M such that 

flAUi = Qi^iM) for all (/.I G 

Proof. 

Since (p is assumed to be transitive and exact under G, there corresponds a unique 
g for each (pi such that cpi = gcpQ. We have to prove that c defined by c{g) = q{0{g(po)) 
satisfies the equation (^) for fixed s corresponding to the ireducible space A4. But 
by Remark 2 following Theorem 1, the set of functions c satisfying (|l^) is just the 
set of elements in the parametric invariant space corresponding to the permissible 
parametric funtcion u{-) (cf. also Theorem 1(b)). 



12 A possible starting point for a conceptually simpler 
quantum theory. 

We have now collected enough technical tools to develop the relation between what 
seems to be an intuitively simple theory for quantum phenomena, based on symmetry 
and statistical modelling concepts, and the traditional one, based on an abstract 
Hilbert space formulation. 
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For clarity we repeat the basic assumptions made: We let $ be some general 
hyperparameter space, for which nothing is assumed in principle, but compactness 
was assumed for some of the technical results above. Let G be a group acting on $. 
A parameter 6{-) on $ is called permissible if ^(^i) = 0{4>2) implies 0{g(pi) = 9{g(p2)- 
Any permissible parametric function induces a group G on the image space G, i.e., 
^W))=%<^). 

We will assume here that the group G is transitive; superselection rules can be 
tackled by taking orbits separately. 

Any statistical experiment a can be modelled by a class of probability measures 
indexed by some parameter 9a- Wc will assume here that there is some wellde- 
fined setting where one has the choice between doing any one in a fixed class of 
experiments. We can then always assume that each 9a is a function on the fixed 
hyperparameter space 

A basic assumption of the theory is that the hyperparameter (p can not be esti- 
mated. On the other hand, it is assumed that each parameter 9a can be estimated 
accurately from an ideal experiment. 

This framework is in fact rich enough to give a very interesting theory for ex- 
periments. The theory for each single experiment is contained in existing statistical 
theory. The theory of a collection of possible experiments like this, seems to be new. 

In addition to the assumptions above, it may be convenient to assume, as in 
Definition 3, that (f) in principle can be recovered from all the potential values 9a{(t>)-, 
and, as in Definition 4, that the basic group G is contained in the smallest group 
containing all Ga- However, neither of these assumptions arc really essential. 

For the technical results in this paper, we have made the further rather strong 
asumption that G, in addition to being transitive, is exact and compact. These 
assumptions can certainly be partly removed by going deeper into the mathematics; 
a start of that process was made in Hclland (1998b), using among other things results 
of Varadarajan (1985). However, the point here is mainly to show that a connection 
can be made, at least in the simplest case. Note that the new theory in itself does 
not need these strong assumptions to make sense, but assumptions are needed to do 
calculations and to recover the ordinary quantum formulations. 

The link to ordinary quantum mechanics is found by first considering vectors in 
v. = L^(<I>,i/), where u is Haar measure with respect to G. Invariant subspaces of 
H are spaces that are closed under the operations /(•) — > f{g-)- In particular, the 
subspace of functions f{9{-)) is invariant when 9{-) is permissible; such a subspace 
is called parametric invariant. 

At least when G is compact, then by group representation theory, each invariant 
space can be decomposed into irreducible invariant spaces. In the following, we will 
fix one such irreducible invariant space Ai (under G), and take that as our basic 
Hilbert space. In the compact case, M. will be finite-dimensional. 
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We will now assume that each Oa{-) is so large under the ordering ^ that the 
space Va = {f : f{(p) = /(^a (</'))} contains M.. The space M. then consists of a set 
of functions q{6a{(p)) for a linear space Qa of functions q on 6^. At the same time it 
contains all linear combinations of the basis functions u{-) = {ui{-),U2{-), • • •) of M. 
Prom the conclusion of Theorem 7 we get 



Corollary 5. 

Assume that G is compact, exact and transitive. Then for all integrable functions 
q G Qa there exists a unique operator A^^ on A4 such that 

flAlf^, = q{Oa{cPi)) (13) 

for all e^. 

Proof. This follows siiicG ^(^a(*)) ^ -A/l, and. Iigiicc is a linecir function of tliG bcisis 

Note that we here only have used Theorem 7 for linear functions of n(-), not the 
full potential of this basic result. 

Without loss of generality we assume that Qa contains the identity function; this 
can always be achieved by redefining Oai')- 



Definition 9. 

The A-spectrum of the parametric function 9a{-) is the set of values 9i of 9a{ ) 
for which one can find a function f ^ M. such that Aaf = 9if , where Aa is the 
operator of with q being the identity. 



Since Aa is a bounded operator on a finite-dimensional space, it has a finite 
spectrum {9i} and a corresponding orthonormal set of eigenfunctions {/«} spanning 
M. 
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Theorem 8. 

The A-spectrum of 9a{-) is equal to its spectrum as defined in Definition 7. In 
particular, it will always be a 0- spectrum, so A4, when correspondingly restricted will 
be a Ga-invariant parametric subspace of the restricted Ti. 

One consequence (Proposition 2a) is that the spectrum is concerved under trans- 
formations Qa ^ Ga- 

Proof. 

We will first show that the A-spectrum is a 0-spectrum. 

Assume first that the A-spectrum is non-degenerate. Let p{-) G Va, so that by 
what has been assumed, p(<^) = q{Oa{(p)) for some q. Since G is transitive, the 
irreducible representation D{-) is transitive on M. Fixing fg^ as one eigenvectors 
of A a, we can therefore find (pi — gi(f>o such that /^^ are the other eigenvectors of 
Aa- Since then Aaf^- = 9if^^, it is clear that (|l^ ) holds with (pi replaced by (pi 
and A^ = q{Aa). Since this can be done for any q, the set Qa when restricted 
to the spectrum is the set of all functions, so Va restricted to the spectrum equals 

restricted to the spectrum. Since the number of eigenvalues of Aa equals the 
dimension of M, the spectrum is maximal in this respect, so by definition it is a 
0-spectrum. 

The degenerate case can be handled by a limiting argument. 

We will now show that among all 0-spectra, the A-spectrum minimizes unneces- 
sary variance. First we remark that Yarr]{(p) = E(Var(r/((/))|0)) +Ya.r(E(ri(<p)\9)), so 
this is equivalent to the rquirement that the last term, or more generally \ar (k{6)) 
for an arbitrary function k is maximized for the A-spectrum. 

Since any 0-spectrum leads to a basis for an invariant space (under Ga), the basis 
vectors fi will be orthonormal, so fifj = I. Let now k{6) be a function on Q^i 
and let k{Aa), respectively k2{Aa) be the operators such that fjk{Aa)fi = k{Oi) and 
fjk2{Aa)fi = k{6iy, respectively. By Schwarz' inequality 

f^kiAaffi>ki9,f = fJk2{Aa)fi 

with equality if and only if the /j's are eigenvectors for k{Aa), hence Aa (a slightly 
better precision on the choice of k could have been formulated here, but that is trivial 
and rather uninteresting). Therefore by a straightforward calculation the variance is 
maximized when the 0-spectrum is an A-spectrum. It is known that the A-spectrum 
is unique, so that means that the spectrum as defined in Definition 7 is unique. 
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13 The connection. 



We have defined a state as the specification of the value 6i of a parametric function 
Oa{-)- But by what has been said above, this is equivalent to specifying a 'wave 
function' f{<j)) = /6»i(^a(0))) which is an eigenfunction of the corresponding opera- 
tor Aa- In this respect the correspondence is complete. 'Wave functions' that are 
not eigenfunctions of some parametric function/ operator are hardly of interest in 
quantum mechanics. 

Most of what has been said so far, except the concentration on the irreducible, 
invariant space A4, might be transferable to many experimental situations. We will 
now make three additional assumptions which lead to the specific formalism met in 
quantum theory. 

First, we have already assumed that all observed values from an ideal measure- 
ment of a parameter 9ai-) belong to its spectrum. From what has just been shown, 
this seems a very plausible assumption from a purely statistical point of view: It 
just serves to concentrate values in such a way that A4 will serve as a parametric 
invariant space for 9a{-)- In quantum mechanics one wants to go a step further: Only 
values in the spectrum have meaning. 

Secondly, we make the reversibility assumption 

P[0a{-) = OiM-) = e'2] = PM-) = e'M) = Oi] 

for all choices 9a{-),0h{-),9i,92- 

Thirdly, we assume the following: For the pair of parameters Oa{-) and Oi,{-) we 
can find orthonormal bases {fg, } and {fni } that the state space A4 is irreducible 

under the subgroup of G generated by permutations among the 6i and among the 

These assumptions are enough to recover the familiar transition formulae of quan- 
tum theory. 

Theorem 9. 

(a) We have 

where 9i is in the spectrum of 9a{), ^2 ^■^ spectrum of9h{-), fg^i') = fe^i^ai-)) 

is the eigenfunction corresponding to 9i and f^, (•) = fg, {9h{-)) is the eigenfunction 
corresponding to 

(b) In ^T^ ) let q{9i) = E{9b{4')\9a{(p) = 9i). Then the unique operator of the 
equation has the form Pg^Ai,Pg_^ where Pg_^ projects upon the space corresponding to 



38 



9i and Ai, is the same for every choice of parameter Oa{-)- Hence 

Ei9,i^)\9aicP) = 9^) = ff^A,fl (14) 
holds for all pairs of parametric functions. 



Proof. 



(a) In a simpler notation we have to prove the following (again using (13)): 
Let uo,ui, ... be an orthonormal basis of some vector space, and let vo,vi, . . . be 
another orthonormal basis of the same space. Assume that there are operators, each 
set summing to /, such that for all i,j we have 

ulPjUi = vjQiVj 

(; this is the reversibility condition). Then this should imply that the only solution 
is Pj = Vjvj, Qi = Uiu\. 

It is clear that this is a solution; the problem is to prove that it is the only one. 
Assume that there is a second solution = Pj + Ej , Q* = Qi + Fi. Then 

u\EjUi = vjFiVj = 

for all i,j. But this implies that for all members g of the group generated by 
permutations of the two set of indices we have 

ulu{g)^EoU{g)uo = 0, 

similarly for the other equation. By assumption 3 and Theorem Al, this implies 
Eq = 0, similarly Ej =0. 

(b) By (a), we can take Ab = Y.j ^'jfe'jl in (0)- This means that P^^AbPQ^ is 
a solution in ([l3D; by Theorem Al it is the only solution. 



Some of the familiar consequences of (|T^ are: 

1. If Ajj can be made diagonal with respect to some countable orthogonal basis 
system, then the conditional expectation of 9b{<p), and of any function of 6'fe((/>) 
can be given as if the probability of some eigenvalue was the squared norm of 
the component of the state vector along the corresponding eigenvector. 
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2. If fg^ is an eigenvector for Af, corresponding to eigenvalue A, then the condi- 
tional expectation of this state is A, and the conditional variance is 0, soO^, = X 
almost surely in the state fg^. This must be taken quite literally, since all we 
know in this state is the value 9i ; any knowledge of the underlying cp is assumed 
to be absolutely impossible. 

Note: All this theory has been built upon a one-to-one correspondence between 
parameters 9{-) and values 9i of such functions on the one hand and Hilbert state 
vectors feiiSi')) on the other hand. This means that a pure state is simply a pair 
{9{-), 9i). If knowledge of the hyperparameter cpi was a possibility, then from a quan- 
tummechanical point of view a rather trivial theory would result. On the other hand, 
it is this situation which is closest to practically all current statistical applications. 
One can easily imagine large scale situation, too, where the notion of complementar- 
ity is relevant. We are quite convinced that the way of thinking here, using model 
groups and related model reductions may be useful also in statistical applications, 
but the apparatus that will be needed for this remains to be developed. 

In quantum mechanical applications, the parameters 9{-) seem to be very close 
to what Bohr and Ulfbeck (1995) call symmetry variables, while saying that these 
variables constitute the elements of physical reality. 



14 A simple large-scale 'quantum mechanical' example. 

Since we have shown that our approach gives a formalism that is essentially equiv- 
alent to the ordinary quantummechanically formalism, any standard example from 
quantum theory could have been used to illustrate our theory. Instead we will give a 
large scale example, using a group which is as simple as possible. More examples of 
direct physical relevance, using a related symmetry approach, can be found in Bohr 
and Ulfbeck (1995). 

One of the simplest non-commutative groups is the group S3 of permutations 
of 3 objects. Visually, it is the group of possible positions of a solid triange under 
the constraint that the corners should touch 3 fixed points on a table, say. It has a 
two-dimensional representation discussed in many books in group theory and several 
books in quantum theory. The quantum theory book by Wolbarst (1977) is largely 
based on this group as a pedagodical example. The famous controverses around 
Marilyn vos Savant's example with doors, goats and a car (see Morgan et al., 1991, 
and references there) can also be coupled to the non-commutativeness of the group 
S3. 

We will identify the hyperparameter space $ with the 6 elements of G = ^3: 
1=ABC, 2=CAB, 3=BCA, 4=ACB, 5=CBA and 6=BAC, where we can picture the 
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letters as marking the corners of the triangle, read counter-clockwise with the first 
letter at the lower lefthand corner. We still think of a solid, physical triangle, and we 
may assume that it is painted white on one side, say, corresponding to the rotation 
permutations 1, 2 and 3, and black on the other side, corresponding to the reflections 
4, 5 and 6. 

In this case the two-dimensional representation of the group is not only a homo- 
morphism, but an isomorphism (Wolbarst, 1977). Hence the permissible parameter 
corresponding to the two-dimensional representation (cp. Theorem 1) is just the 
identity in this case. There are 4 parametric functions of interest: Oa{-), Ob{-), ^ci'), 
whose values are the upper case letter at the corners a, b and c, respectively, and 
Oo{-), whose value is the colour of the triangle. 



Proposition 4. 

Oo{-) is a permissible parametric function, while Oa{-), Oi,{-) and 9c{-) each are 
non-permissible. 

Proof. 

Let the group element which takes the ground position 1 to the position i as listed 
in the hyperparameter list above, be called gi {i = 1, . . . ,6). Assume ^o(^^i) = ^0(^2), 
say black, for two hyperparameter values 01 and (j)2 = 1,2, ... ,6. Then by simple 
inspection 0o(,9(<a>i) = Oo{gi(t^2) = 'black' for i = 1,2,3 and 'white' for i = 4,5,6. 
Hence 60 is permissible. 

For the other functions it is enough to produce a counterexample. Here is one for 
ea-. We have ^^(l) = 0„(4) = A, but ^^(^sl) = ^a(5) = C and Oaig^^) = 0a(3) = B. 



The maximal reduced group corresponding to 9a will consist of the identity to- 
gether with a single reflection: Ga = {91,94}, which is transitive on = {1,4}, the 
set of elements with 6a = A. With respect to this group the parameter 9a will be 
permissible. The results for 9^ and 9c are similar. 

So far the terms 'parameter' and 'hyperparameter' have only been words, but now 
we intend to construct a large scale quantum system based on this solid triangle. 
To this end, place the triangle within a hollow nontransparent sphere in such a way 
that it can rotate freely around its center point, placed at the centre of the sphere. 
Let there first be 4 windows in the sphere, one at the north pole, where the colour 
facing up can be observed, and three equidistant windows along the equator, where 



41 



single corners of the triangle can be observed. We could also let the observations in 
each window be uncertain for some reason, but for simplicity we will again stick to 
ideal measurements in each window. 

To make this correspond as well as possible to a quantummechanical system, we 
have to make some perfections on the sphere with its windows and solid triangle 
inside. We assume that we have some mechanisms to the following ends: 

Before any measurements are made, the triangle rotates completely freely around 
its center point. Once an equatorial window is opened, there is some mechanism 
which draws the nearest corner to this window. After the measurement at this 
window is done, this particular corner is fixed, and the triangle rotates around the 
implied axis until a new equatorial measurement is done. The second measurement 
implies that the hold in the first corner loosens. 

Once the top window is opened, the triangle flattens out to (the nearest) horison- 
tal position, and it continues to rotate in this position until a new measurement is 
done. 

We assume that the windows are ideal in the sense that they never provide more 
information than they are supposed to. We finally assume that it is impossible to 
open the sphere. 

Note that each 'observable' 9a, 0^, 6^ has three different values. In the state where 
9a = A has been observed, each of the observables and 9c then has a probability 
of 1/2 on each of the values B and C. If a ^o-measurement has been done, the next 
observation in an equatorial window will be essentially uneffected. Therefore the 
state of the system after two such observations, one in the top window and one in 
the horisontal window, is completely determined. 

Note also the following feature related to the general theory: If only observation 
of the permissible observator is allowed, this will be a rather trivial system, where 
every future observation is fixed after just one measTircmcnt 

A 'true' quantum mechanical system will first be obtained if we close and seal the 
top window. Then the triangle is never at rest. Each measurement puts the system 
into a new state. In particular, the question of retrieving the 'true' position of 
the triangle, from a measurement, is out of question. 

Standard quantummechanical calculations can be carried out for this example, 
e.g., by choosing a three-dimensional basis where 9a{-) is diagonal in the sense that 
it can be associated with a diagonal 3x3 matrix Aq^ . In this basis neither 9i, nor 9c 
are diagonal. However, ordinary probability calculations are of course much easier 
to carry out in this case. 



42 



15 Concluding remarks. 



In this paper we have confined the discussion to the parameter space of the exper- 
iment in question. Taking up again the idea from Section 2 of a set of potential 
experiments {Xa,^a,{Pa'i^ ^ &a}',ct G A}, a related discussion would result if we 
introduce a symmetry group on A. This discussion seems to be related to random- 
ization theory, where group representations and invariant spaces have already proved 
to be of fundamental value (Bailey, 1991; Bailey and Rowley, 1990). An interesting 
question is if it is possible to combine these two approaches. 

It is of course too early to say anything final about the value of the mathematical 
results discussed in this paper. However, personally, I feel that there is a possibility 
that results of this kind can help starting a process which eventually may shed new 
light upon several aspects in the interpretation of quantum physics and also perhaps 
upon more general issues related to modelling, observing and interpreting nature. If I 
am right in these assumptions, there seems to be room for and need for much further 
work in several directions, and I think that some of this work may call for cooperation 
between several disciplines, physicists, mathematicians, statisticians, philosophers, 
and perhaps also other specialists. 

At least on an intuitive level, I feel that there are less philosophical difficulties 
implied if one should be able to develop a complete parameter interpretation of 
quantum physics than what is implied by the conventional interpretation. Issues like 
Scrodinger's cat become less problematic when everything is governed by models with 
hypothetical parameters instead of formalisms involving states that are assumed to 
be 'real' in some sense. Similar remarks may help to explain why the Kochen-Specher 
theorem (Isham, 1995) does not seem to represent a problem here. 

However, the discussion around hidden variables is very relevant to the model 
discussed in this paper. If the present models shall describe something like the real 
quantum world, then it must be impossible by any means to get full information 
about (/>!. Only certain derived parameters can be estimated. One question then 
might be if models involving action at a distance on the hypcrparamctcrs can be 
permitted, since noe information is ever retrieved about these parameters. The 
discussion of Section 2 is relevant here. 

Also, several other problems do exist, for instance: What models should be 
allowed in quantum physics? What models are reasonable? Arc there ways to tackle 
ambiguity in the model choice? Again there arc many details to be worked out, but it 
is quite obvious that symmetry considerations must be very important in governing 
model choice in this field. 

In this paper I have not said anything about time development and the Hamil- 
tonian. The issue was briefly touched upon in Holland (1998a). 

Modelling involves information. It would be very interesting if some connections 



43 



could be found to classical themes like Maxwell's demon, to blackbody-radiation or 
to thermodynamics in general. 

At first sigiit the models of this paper may seem too simple to explain the com- 
plicated world of quantum physics. A natural question, for instance, is: Where do 
we find the superposition of state vectors in these models? In fact, we do find it, 
for instance in equations of the type Pei/0i(<?^) = foii^i^t^)) used above, or more 
generally, whenever focus is changed from one parameter to another. 

Griffiths' (1984) history approach is an important issue in current development of 
quantum physics. Closer connections here would definitely be of interest to study. 
Of special interest may be the connection between decoherence in quantum physics 
and randomization and other ways of deliberately ignoring information in statistics. 

A theme that is not touched upon at all here, is generalization to relativistic 
systems. There are undoubtedly problems here demanding the skills of both physi- 
cists and mathematicians. But knowing that large parts of high energy physics is 
based upon group theory, even the most superficial thinking around this may point 
at possibilities for some kind of parametric modelling based on symmetry. 

Calculation is another problem area that seems to lead to new possibilities if 
the theories in this paper are accepted. In some future one can think of both sta- 
tistical techniques (Markov Chain Monte Carlo for instance) that may turn out to 
be useful in quantummechanical calculations, or one can investigate if in the large 
resource of highly developed quantummechanical calculating techniques there might 
be something that is useful for instance in Bayesian statistics. 

Finally, again if the ideas here are accepted, there seems to be room for thinking 
through once again some concepts in statistical theory. But that is not a theme here. 

As repeatedly said above, all these remarks assume that the main idea of this 
paper should be accepted, namely that there is a possibility of a common foundation, 
at least partly, for the two areas which today seem so far apart: Theoretical statis- 
tics and quantum theory. My own feeling is definitely that there seems to be such 
a possibility, and that the technical arguments in the main body of this paper may 
give some of the tools needed to find that possibility. Another very recent develop- 
ment which seems to point in the same direction, is the Fisher information approach 
(Frieden, 1995; see also Matthews, 1999) treating among other things the relation 
between Cramer-Rao's inequality in statistics and Heisenberg's inequality in quan- 
tum physics. The book by Prieden discusses largely problems that have not been 
touched upon here, however; on the other hand, no discussion of quantum theory is 
given there except for the treatment of the Schrodinger equation and its relativistic 
counterparts. 
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Appendix 1: Some group representation theory for 
compact groups. 

In general, a matrix representation of a group is defined as a function D from 
the group to the set of (here complex) matrices satisfying D{gh) = D{g)D{h) for 
all g and h. Any representation D and any fixed nonsingular matrix K of the same 
size can be used to construct another representation S{g) = KD(g)K^^ . If the 
group is compact (and also in some other cases), we can always find such an S of 
minimal block diagonal form, and at the same time we can take S to be unitary 
{S{g)^S{g) = I). If (and only if) the group is Abelian, each minimal block will be 
one-dimensional. 

An important aspect of this reduction appears if we look upon the matrices as 
operators on a vector space: Then each collection of blocks gives an invariant vector 
space, and each single minimal block gives an irreducible invariant vector space. For 
compact groups, the irreducible invariant vector spaces will be finite-dimensional. 
The minimal matrices in the blocks are called irreducible representations of the 
group. 

More generally, a class of operators {U{g);g G G} (where G is a group) on a, 
possibly infinitedimcnsional, vector space is a representation if U{gh) = U{g)U{h) 
for all g, h. A representation of a compact group has always a complete reduction in 



46 



minimal matrix representations as described above. (The theory of general groups 

is more complicated.) In particular, this holds for the unitary regular representation 
defined on a Hilbcrt space Ti. = L^{^,u) by UR{g)f{<j)) = f{g^^4')- Here u is the 
right Haar measure for G on $ (defined by y{Bg) = i'{B) for all B,g). 
Two useful results are Schur's lemmas: 

(1) If I? and D' are irreducible representations of different dimensions, and A is 
such that D{g)A = AD'{g) for all g, then necessarily, A = 0. 

(2) If D and D' are irreducible representations of the same dimension, and A is 
such that D{g)A = AD'{g) for all g, then either D and D' are isomorphic or ^ = 0. 
If D{g)A = AD(g) for all g, then necessarily A = XI for some scalar A. 

The character of a group representation is defined as x(fl') = tr(L'((7)). The 
characters of the minimal blocks Dr{g) are the irreducible characters Xr{9)- These 
satisfy the orthogonality relations 

j Xr{g)*Xr'{9)^idg) = Srr', 

where u is the Haar measure of the group (which can be uniquely defined as a 
probability measure for compact groups). 

Let now V (^Ti. = L^($, i^) be an invariant vector space of the regular represen- 
tation UR{g) of a compact group. The orthogonal projection upon the irreducible 
part of corresponding to Ur{g) (dimension dr) can then be computed as 

Pr = dr j Xr{gTUR{g)v{dg). 

A Fourier transform corresponding to the representation UR^g) can be defined 
by taking 

f{Ur) = j f{g)Ur{g)u{dg), 
for each irreducible subrepresentation, and it can be inverted by 

f{g) = Y,dMUr{g-^)f{Ur)). 

r 

More on group representations can be found in Diaconis (1988), Hamermesh 
(1962), James and Liebeck (1993), Serre (1977) and Wolbarst (1977). 

Any regular representatation Ur on an L^-space can equivalently be defined on 
P by letting u{-) = (ni(-), 'U2(-), . . .) be a basis for and taking {UR{g)v)^u{-) = 
v'^UR{g)^u{-) = v^u{g-). 
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Appendix 2: The construction of an operator. 

The following result is the basis for proving Theorem 4 and the theory in Sections 
11 and 12: 

Theorem Al. 

Let [/(•) be a unitary irreducible representation of a compact group G on a vector 
space V , let V £ V {v ^ 0), and let c(-) be a function on G which is integrable with 
respect to Haar measure v for G. Then there exists a unique linear operator Q on 
V such that c{g) and 

qig)=v^U{g)^QU{g)v (15) 
have the same Fourier component corresponding to this space: 

I c{g)U{g)v{g) = J q{g)U{gHg). 

Proof. 

With Haar measure u, the Fourier transform of c(-) is 

c{U) = J c{g)U{g)u{dg). 

We want to prove 

c{U) =v^ I U{g)^QU{g)vU{g)u{dg). 

This is equivalent to 

c{U)ki = i^Q J U{gWu{g)^U{g)kiy{dg); 

hence 

c{U) = Y^QijDij, (16) 
where, for each (i,j) the operator Dij is defined by 

D^J = J {U{9)v)UU{9)y)fi{9>{dg). 

We want to show that this equation has a unique solution. For compact groups, 
every irreducible representation is finite. Hence, taking components and using double 
indices, (|l^ ) is a matrix equation of the form Tx = c, which has a unique solution 
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for all c iff the adjoint equation T^x = c has a unique solution for all c, and this 
happens if and only if T^^y = implies y = 0. 

Assume then that (^) does not have a unique solution. Then there exists a 
nontrivial set of constants {cki} such that 

/ r{g)U{g)vvm{g)^u{dg) = for r{g) = ^ CMUkl{g). (17) 

k,l 

From (^) for a fixed v V it follows that 

J r{g)U{g)ab^U{g)^u{dg) = 0, (18) 

first for a = U{ga)v, b = U{gb)v by multiplying ( p^ by suitable f7-matrices on both 
sides, then for linear combinations of such terms, and hence for all a,b £ V, since 
these linear combinations span V by Theorem 2(b). 

Then, by letting a and b be basis vectors in V and multiplying both sides by 
another set of basis vectors, it follows that 

J r{g)U,j{g)Uui{g)v{dg)=Q 

for all i,j,k,l. We now use the fact that the Fourier transform can be uniquely 
inverted. This first gives r{g)Uij{g) = for all from which it follows that 
r{g) = 0. The irreducible representations of a compact group satisfy orthogonality 
relations of the form d J Uki{g)Umnig)i^{dg) = Sim^kn (cf. Hamermesh, 1962). Hence 
it follows from the definition ([TtI) that Cij = 0. This proves the existence and the 
uniqueness of the solution of (|15|) for integrable c(-). 
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